Extracting Social Networks from Literary Text with Word Embedding Tools

نویسندگان

  • Gerhard Wohlgenannt
  • Ekaterina Chernyak
  • Dmitry I. Ilvovsky
چکیده

In this paper a social network is extracted from a literary text. The social network shows, how frequent the characters interact and how similar their social behavior is. Two types of similarity measures are used: the first applies co-occurrence statistics, while the second exploits cosine similarity on different types of word embedding vectors. The results are evaluated by a paid micro-task crowdsourcing survey. The experiments suggest that specific types of word embeddings like word2vec are well-suited for the task at hand and the specific circumstances of literary fiction text.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Extracting Networks of People and Places from Literary Texts

We describe a method to automatically extract social networks from literary texts. Similar to those in prior research, nodes represent characters found in the texts; edges connect them to other characters with whom they interact, and also display sentences describing their interactions. Furthermore, other nodes encode places and are connected to characters who were active there. Thus, these net...

متن کامل

Quote Attribution for Literary Text with Neural Networks

We propose a method for using neural networks to attribute quotes in literary texts. Since previous work has been unable to successfully solve this problem based on bag-of-words features, we study the issue of whether this is due to the limited expressiveness of such features. By re-framing the modeling of quotes and characters as based off of word vectors, we hope to demonstrate that individua...

متن کامل

Exploring the Betrothed Lovers

We present the ongoing activities and the first results achieved in a research project concerning the understanding of narrative in the high school. Students and teachers experimented with new ways to learn linguistic and digital skills, by using a collaborative learning environment built around the novel I Promessi Sposi. We analyzed the literary text, extracting social networks of characters ...

متن کامل

Using word embedding for bio-event extraction

Bio-event extraction is an important phase towards the goal of extracting biological networks from the scientific literature. Recent advances in word embedding make computation of word distribution more efficient and possible. In this study, we investigate methods bringing distributional characteristics of words in the text into event extraction by using the latest word embedding methods. By us...

متن کامل

The Actor-Topic Model for Extracting Social Networks in Literary Narrative

We present a generative model for conversational dialogues, namely the actortopic model (ACTM), that extend the author-topic model (Rosen-Zvi, et.al, 2004) to identify actors of given conversation in literary narratives. Thus ACTM assigns each instance of quoted speech to an appropriate character. We model dialogues in a literary text, which take place between two or more actors conversing on d...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016